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Abstract 

In this note, we propose two different approaches to rigorously justify a pseudo- 
Markov property for controlled diffusion processes which is often (explicitly or im¬ 
plicitly) used to prove the dynamic programming principle in the stochastic control 
literature. The first approach develops a sketch of proof proposed by Fleming and 
Souganidis [9]. The second approach is based on an enlargement of the original 
state space and a controlled martingale problem. We clarify some measurability 
and topological issues raised by these two approaches. 
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1 Introduction 

The dynamic programming principle (DPP) is a key step in the mathematical analy¬ 
sis of optimal stochastic control problems. For various formulations and approaches, 
among hundreds of references, see, e.g., Fleming and Rishel [ 7 ], Krylov |13) . El 
Karoui [5], Borkar [T], Fleming and Soner |8], Yong and Zhou |22) . and the more 
recent monographs by Pham m or Touzi [21]. 

The DPP has a very intuitive meaning but its rigorous proof for controlled dif¬ 
fusion processes is a difficult issue in all cases where the set of admissible controls is 
the set of all progressively measurable processes taking values in a given domain. In 
particular this occurs in the stochastic analysis of Hamilton-Jacobi-Bellman equa¬ 
tions. The highly technical approach to the DPP developed by Krylov in [13] con¬ 
sists in establishing a continuity property of expected cost functions w.r.t. controls 
(see m Cor.3.2.8]) in order to be in a position to only prove the DPP in the simpler 
case where the controls are piecewise constant. When the controls belong to this 
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restricted set, the DPP is obtained by proving that the corresponding controlled 
diffusions enjoy a pseudo-Markov property which easily results from the classical 
Markov property enjoyed by uncontrolled diffusions. See [131 Lem.3.2.14]. 

In their context of stochastic differential games problems, Fleming and Sougani- 
dis [HI Lem. 1.11] propose to avoid the control approximation procedure by estab¬ 
lishing the pseudo-Markov property without restriction on the controls. Although 
natural, this way to establish the DPP leads to various difficulties. In particular, 
one needs to restrict the state space to the standard canonical space. Unfortunately 
the proof of the crucial lemma 1.11 is only sketched. 

Many other authors had implicitly or explicitly followed the same way and par¬ 
tially justified the pseudo-Markov property in the case of general admissible controls: 
see, e.g., Tang and Yong [201 Lem.2.2], Yong and Zhou [221 Lem.4.3.2], Bouchard 
and Touzi [2] and Nutz M- However, to the best of our knowledge, there is no full 
and precise proof available in the literature except in restricted situations (see, e.g., 
Kabanov and Kliippelberg |10|L 

When completing the elements of proof provided by Fleming and Souganidis 
and the other above mentioned authors, we found several subtle technical difficul¬ 
ties to overcome. For example, the stochastic integrals and solutions to stochastic 
differential equations are usually defined on a complete probability space equipped 
with a filtration satisfying the usual conditions; on the other hand, the completion 
of the canonical cr-field is undesirable when one needs to use a family of regular 
conditional probabilities. See more discussion in Section 12.31 

We therefore find it useful to provide a precise formulation and a detailed full jus¬ 
tification of the pseudo-Markov property for general controlled diffusion processes, 
and to clarify measurability and topological issues, particularly the importance of 
setting stochastic control problems in the canonical space rather to an abstract non 
Polish probability space if the pseudo-Markov property is used to prove the DPP. 

We provide two proofs. The first one develops the arguments sketched by Flem¬ 
ing and Souganidis [9]. The second one is simpler in some aspects but requires some 
more heavy notions; it is based on an enlargement of the original state space and a 
suitable controlled martingale problem. 

The rest of the paper is organized as follows. In Section [21 we introduce the 
classical formulation of controlled SDEs. We then precisely state the pseudo-Markov 
property for controlled diffusions, which is our main result. We discuss its use 
to establish the DPP. To prepare its two proofs, technical lemmas are proven in 
Section 131 Finally, in Sections U] and O we provide our two proofs. 

Notations. We denote by D := the canonical space of continuous func¬ 

tions from M"*" to which is a Polish space under the locally uniform convergence 
metric. B denotes the canonical process and F = [Bg-, s > 0) denotes the canoni¬ 
cal filtration. The Borel cr-field of D is denoted by B and coincides with \/^>q 
W e denote by P the Wiener measure on (D, under which the canonical process 
B is a F-Brownian motion, and by the collection of all P-negligible sets of 
J-”, i.e. all sets A included in some N ^ B satisfying ¥{N) = 0. We denote by 
F'^ = {J-f, s > 0) the P-augmented filtration, that is, J-f := Fg V . We also set 

Finally, for all x D, the stopped path of uj at time t is denoted by 

[uj\t := (w(t A s), s > 0). 
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2 A pseudo-Markov property for controlled dif¬ 
fusion processes 


2.1 Controlled stochastic differential equations 

Let [/ be a Polish space and be the collection of all square matrices of order d. 

Consider two Borel measurable functions b : M'*' x 0 x [/ —>■ and a : x 0 x 
U —>■ Sd^d satisfying the following condition: for all u £ U, the processes b{-,-,u) 
and cj(-, ■,u) are F-progressively measurable or, equivalently, (t,x) i—>■ b{t,x,u) and 
(t,x) I—?■ a{t,x,u) are ( 8 ) J^-measurable and 

b{t,x,u) = b{t,[x\t,u), a{t,x,u) = a{t,[x]t,u), V(f,x)GR^xO 

(see Proposition 13.41 below!. In addition, we suppose that there exists C > 0 such 
that, for all {t,x,y,u) € x xU, 


'\b{t,x,u)-b{t,y,u)\ +\\a{t,x,u) - a{t,y,u)\\ < C sup |x(s)-y(s)|, 

0<s<t 

< / 

sup (|6(t,x,ri)| + ||(T(t,x,ri)||) < C ( 1 + sup |x(s)| 

\ 0<s<£ 

Denote by U the collection of all [/-valued F-predictable (or progressively mea¬ 
surable, see Proposition Em below) processes. Then, given a control v £lA, consider 
the controlled stochastic differential equation (SDE) 



dXs = b{s,X,Vs)ds + a{s, X,i's)dBs. (2) 

As B is still a Brownian motion on (D, F^) (see, e.g., |12l Thm.2.7.9]) we may 

and do consider ([2l) on this hltered probability space. 

Under Condition ([T]), for all control v £lA and initial condition (t,x) G M'*' x D, 
there exists a unique (up to indistinguishability) continuous and F'^-progressively 
measurable process in (D, such that, for all 9 in M'*', 

ptye 

Xg^’’^ = x(tA9)+ / b{s, ,iys)ds + / a{s, ,iys)dBs, P — a.s. 

Jt Jt 

( 3 ) 

Our two proofs of a pseudo-Markov property for use an identity in law 

which makes it necessary to reformulate the controlled SDE ([2]) as a standard SDE. 

Given (t, i/) G M'*' xlA, we dehne x —>■ and x —)> S 2 d,d 

as follows : for all s G u = G 




CO : = 


b{s,ui',i'siuj)%>t 


d^’''{s,cb) := 


Ud 


Then, given x in D, ;= [B,X^'^’'^) is the unique (up to indistinguishability) 

continuous and F'^-progressively measurable process in (f/jX^jP) such that, for all 
9 in M+, 





6*’'^(s,y*’X.^^)c[s + 


d^'’'{s,Y^'^’’^)dBs 


F-a.s. (4) 


Dehne pathwise uniqueness and uniqueness in law for standard SDEs as, respec¬ 
tively, in Rogers and Williams [TSl Def.V.9.4] and [181 Def.V.16.3]). In view of the 
celebrated Yamada and Wanabe’s Theorem (see, e.g., |181 Thm.V.17.1]), the former 
implies the latter and the SDE (|4]) satishes uniqueness in law. 
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Remark 2.1. (i) The stochastic integrals and the solution to the SDE ([2]) are defined 
as continuous and adapted to the augmented filtration 

(ii) In an abstract probability space equipped with a standard Brownian motion W, 
our formulation is equivalent to choosing controls as predictable processes w.r.t. 
the filtration generated by W. Indeed such a process can always be represented as 
i^{W.) for some u in LI (see Proposition below). This fact plays a crucial role 
in our justification of the pseudo-Markov property for controlled diffusions. In their 
analyses of stochastic control problems, Krylov m or Fleming and Soner m do not 
need to use this property and deal with controls adapted to an arbitrary filtration. 

(iii) We could have defined controls as -predictable processes since any -predictable 
process is indistinguishable from an ¥-predictable process (see Dellacherie and Meyer Q 
Thm.IV.78 and Rk.IV.Vf]). Notice also that the notions of predictable, optional and 
progressively measurable process w.r.t. ¥ coincide (see Proposition\3.4\ below). 


2.2 A pseudo-Markov property for controlled diffusion 
processes 


Before formulating our main result, we introduce the class of the shifted control 
processes constructed by concatenation of paths: for all u in U and (t, w) in M"*" x O 
we set 

{uj) ■.= V(s, w) G M"*" X O, 

where w (8>t cu is the function in O defined by 


(w (?)* a;)(s) 


w(s), if 0 < s < t, 

w(t) + ti;(s) — uj{t), if s > t. 


Our main result is the following pseudo-Markov property for controlled diffusion 
processes. As already mentioned, it is most often considered as classical or obvious; 
however, to the best of our knowledge, its precise statement and full proof are 
original. 


Theorem 2.2. Let <1> : —)• M"*" be a bounded Borel measurable function and let J 

be defined as 


J{t,x,i^) := V(t, X, i/) G M+ X X ZY. 

Under Condition m, for all (t, X, u) G M"*" X Ll xU and ¥^ -stopping time r taking 
values in [t, -t-oo) we have 




F{duj)-a.s. (5) 


Remark 2.3. (i) Our formulation slightly differs from Fleming and Souganidis m 
who consider deterministic times t = s and conditional expectations given the non- 
augmented a-field Pg- 

(ii) We work with the augmented a-fields in order to make the solutions 

adapted. 

(iii) One motivation to consider stopping times r rather than deterministic times is 

that, to study viscosity solutions to Hamilton-Jacobi-Bellman equations, one often 
uses first exit times of from Borel subsets ofW^. 

(iv) It is not clear to us whether the function J is measurable w.r.t. all its arguments. 
However Theorem \2f^ implies that the r.h.s. of (© is a -measurable random 
variable. 
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In Section 12.31 we discuss technical subtleties hidden in Equality ([5]) and point out 
some of the difficulties which motivate us to propose a detailed proof. Before further 
discussions, we show how the pseudo-Markov property is used to prove parts of the 
DPP. 

Consider the value function 


V{t,x) := supE’^ , V(t,x) G M+ x D. (6) 

udU 

Proposition 2.4. (i) For all (t,x) in x D, it holds that 

V{t,x) = sup E^ , (7) 

Atew* 

where lA^ denotes the set of all n gU which are independent of Ft under P. 

(ii) Suppose in addition that the value function V is measurable. Then, for all 
(t,x) G M+ X D, e > 0, there exists v G lA such that for all -stopping times r 
taking values in [t,oo), one has 

P(t,x)-e < E^[E(r,[X*’"’"]J] . (8) 

Proof, (i) For ([7|), it is enough to prove that 

V{t,x) < sup E^ [4>(X*’^’^)] , 

since the other inequality results from lA^ C lA. Now, let v be an arbitrary control 
in lA. Apply Theorem 12.21 with t = t. It comes: 

J{t,x,v) = f J [t,[x]t,i'^’‘^)F{duj) = f J [t,x,n^’‘^)F{dco). 

Jgl J u 

Then ([7]) follows from the fact that, for all fixed w G D, the control belongs to 
UK 

(ii) Notice that, for all e > 0, one can choose an e-optimal control v, that is, 

V{t,x) — s < J{t,x,v). 

Apply Theorem 12.21 with this control n. It comes: 


V{t,x)-e < / < E^ [p (r, 


□ 

Remark 2.5. Inequality ([5]) is the ‘easy’ part of the DPP. Equality ([7]), combined 
with the continuity of the value function, is a key step in classical proofs of the 
‘diffieult’ part of the DPP, that is: for all control v in U and all -stopping time 
T taking values in [t,oo), one has 


V{t,x) > E^[e(t, 
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2.3 Discussion on Theorem 12.2 

On the hypothesis on b and a. In Theorem 12.21 the coefficients b and a 
are supposed to satisfy Condition ([1]) and thus the solutions to the SDE ([1|) are 
pathwise unique. However, we only need uniqueness in law in our second proof 
of Theorem 12.21 which is based on controlled martingale problems related to the 
SDE Q. Hence Condition ([T|) can be relaxed to weaker conditions which imply 
weak uniqueness. However we have not followed this way here to avoid too heavy 
notations and to remain within the classical family of controlled SDEs with pathwise 
unique solutions. 


Intuitive meaning and measurability issues. The intuitive meaning of 
Theorem 12.21 is as follows. Notice that satisfies: for all 6 G M+, 




X 


tAO 


rve 


rrve 


+ / b{s,x^'^''' ,i's)ds+ / cr{s,x^’^’’' ,i's)dBs, 


P —a.s. (9) 


If a regular conditional probability (r.c.p.) (PwjW G D) of P given were to exist 
(see, e.g., Karatzas and Shreve m Sec.5.3.C]), then Equation ([9]) would be satisfied 
Pw^almost surely and, in addition, we would have 


Pw(r = r(w),[X*’"’"]^= = (10) 

Hence, Theorem 12.21 would follow from the uniqueness in law of solutions to Equa¬ 
tion Unfortunately the situation is not so simple for the following reasons. 

First, since is complete, a r.c.p. of P given X^ does not exist (see [H Thin. 10]). 
Second, even if r were a F-stopping time and (Pw, w G D) were a r.c.p. of P given 
Xr, then Pw would be defined as a measure on X whereas is adapted to the 

P-augmented filtration F^: hence. Equality f]10p needs to be rigorously justified. 
Third, one needs to check that the stochastic integral in ([9|), which is constructed 
under P, agrees with the stochastic integral constructed under Pw for P-a.a. w. 


3 Technical Lemmas 

In order to solve the measurability issues mentioned above, we establish three tech¬ 
nical lemmas which, to the best of our knowledge, are not available in the literature. 

The first lemma improves the classical Dynkin theorem which states that, given 
an arbitrary probability space and arbitrary filtration (T-Lt), a stopping time w.r.t. 
to the augmented filtration of (T-Lt) is a.s. equal to a stopping time w.r.t. 
see, e.g., m Thin.11.75.3]. The improvement here is allowed by the fact that we 
are dealing with the augmented Brownian filtration (more generally, we could deal 
with the natural augmented filtration generated by a Hunt process with continuous 
paths: see Chung and Williams jS] Sec.2.3,p.30-32]). 

Lemma 3.1. Let t be a W^-stopping time. There exists a ¥-stopping time rj such 
that 

P(r = r?) = 1 and V . (11) 

Proof. First step. As F^ is the augmented Brownian filtration, all P'^-optional 
process is predictable and hence all F'^-stopping time is predictable (see, e.g., Revuz 
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and Yor [T6l Cor.V.3.3]). It follows from Dellacherie and Meyer [H Thm.IV.78] that 
there exists a F-stopping time (actually, a predictable time) r] such that 

P(r = rj) = 1. 

Second step. We now prove J-r/ V C First, since C we have 

J\f^ C . Second, t/ is a F'^-stopping time. Since t = rj, P — a.s. and are 

both P-complete, one has from which C 

Last step. It therefore only remains to prove C J-rj ^ Af^- Let A € J-^. In 

view of [H Thm.IV.64], there exists a F®”-optional (or equivalently F^-predictable) 

process X such that = Xr. By using Theorem IV.78 and Remark IV.74 in [ 4 ], 
there exists a F-predictable process Y which is indistinguishable from X. It follows 
that lyi = Xj- = Yt = Yjj, P — a.s., which implies that M G V Af^ since Y^i is 
V^-measurable. This ends the proof. □ 

The two next lemmas are used in Section|4|only. They concern the r.c.p. (Fw)werJ 
of P given a cr-algebra Q C X. They allow us to circumvent the difficulties raised 
by the fact that F^ is not a measure on X^. The key idea is to notice that, if N 
in X satisfies ¥{N) = 0, then there is a P-null set M €z X (depending on N) such 

that Pw(V) = 0 for all w G \ M. Hence any subset of N belongs to Af^ and, for 

all w G \ M, to the set Af^'" of all Pw-negligible sets of X. 

Lemma 3.2. Define := Q y Af^ and := Q y Af^'". Let E be a Polish space 

and ^ : Ll ^ E be a -random variable. Then 

(i) There exists a Q-random variable : Ll ^ E such that 

F{i=e) = i- 

(ii) For P-fl.fl. w G fl, ^ is a -random variable and 

Pw(C = e(w)) = 1. 

(hi) Let X^'" := X y Af^'" and C, be a ¥-integrable X^-random variable. Then for 
P-a. a. vj ^ Ll, C, is X^'"-measurable and 

E[c I ei^](w) = [ c(w')Pw(dw'). 

Jo, 

Proof, (i) As E is Polish, there exists a Borel isomorphism between E and a subset 
of M. This allows us to only consider real-valued random variables. A monotone 
class argument allows us to deal with random variables of the type Ic with G in 
from which the result (i) follows since 

= {g G X^-, 3G^ eg, GAG° G , (12) 

where A denotes the symmetric difference (see, e.g., Rogers and Williams m 
Ex.V.79.67a]). 

(ii) Let be a ^-random variable such that P(^ = ^*^) = 1. In other words, there is 
a F-null set V G such that = C(w) for all uj € Ll\N. Hence, for P-a.a. w, 

we have Pw(iV) = 0 and G A}A{^° e A} C N belongs to Af^'" for all Borel set 
A. It results from (]12p with P^ in place of P that ^ is ^'^™-measurable. Moreover, 
since is an ^-random variable taking values in a Polish space, we have 

Fw (C° = e°(w)) = 1, 
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for P-a.a. w £ O. Then, for all w £ O \ such that 


Pw(iV) = 0 and Pw = e°(w)) = 1, 


we have 

Pw(c = e(w)) = i. 

(hi) Using the same arguments as in the proof of (ii), we get that ^ is 
measurable. Now, let x be a bounded ^'^-measurable random variable. In view 
of (i), there exists a ^-measurable (resp. J^-measurable) xo (resp. Co) random 
variable such that X = Xo (resp. C = Co) P^a.s. Therefore, 


E'P [Cx] = [CoXo] 




E^Co 



Hence it holds 


E^[C I 6^’"](w) = / Co(w')Pw(dw'), 
Jn 


P(dw) — a.s. 


We then conclude by using that C is -measurable and Pw(C = Co) = 1 for P-a.a 
w £ U. 

□ 


Lemma 3.3. Let F'*'™ be the ¥^-augmented filtration o/F. Let E he a Polish space 
and Y : M+ x Ll ^ E be a -predictable process. Then, for P-a.a. \n ^ Ll, Y is 
-predictable. 

Proof. As already noticed in the proof of Lemma 13.11 there exists a F-predictable 
process indistinguishable from Y under P. The result thus follows from our first 
arguments in the proof of Lemma 13.21 (ii). □ 

We end this section by two propositions which will not be used in the proof of 
the main results but were implicitly used in the formulation of stochastic control 
problems in Section [2.II 

The proposition below ensures that classical notions of measurability coincide 
for processes defined on the canonical space Li. This is no longer true when LI is the 
space of cadlag functions. For a precise statement, see Dellacherie and Meyer 
Thm.IV.97]. 

Proposition 3.4. Let E be a Polish space and Y : M"*" x LI ^ E be a stochastic 
process. Then the following statements are equivalent : 

(i) Y is ¥-predictable. 

(ii) Y is ¥-optional. 

(iii) Y is ¥-progressively measurable. 

(iv) Y is (S' -measurable and ¥-adapted. 

(v) Y is H(M+) lYi iP-measurable and satisfies 

Yfiuj) = n([a;].), V(s,a;) £M+X U. 

Proof. It is well known that (i) (ii) => (Hi) => {iv). Now we show that (iv) 

(v). Remember that Eg = c’'([-]s : ui i—[taj^) for all s £ Since Yg is Pg- 

measurable and E is Polish, Doob’s functional representation Theorem (see, e.g., 
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Kallenberg [111 Lem.1.13]) implies that there exists a random variable Zg such that 
Ys{u)) = Zs{\uj]s) for all a; G H. By noticing that [-j^ o [-j^ = [-j^, we deduce that 

y,([a;],) = Zg{[uj]g) = Ygico), V(s,w) G M+ x H. 

To conclude it remains to prove that (v) (i). Clearly it is enough to show that 

[•] : {s,uj) is a predictable process. In other words, we have to show that 

for any C (z T, [•]~^(C') is a predictable event. When C is of the type 
with t G M+ and A G the result holds true since Bt o [•] : (s,a;) uj{t A s) 

is predictable as a continuous and adapted M'^-valued process. We conclude by 
observing that the former events generate the cj-algebra B. □ 

Now, let be an abstract measurable space equipped with a measurable, 

continuous process X* = Denote by F* = {X*)t>o the filtration generated 

by X*, i.e. X: = s < t}. 

Proposition 3.5. Let E be a Polish space. Then a process Y : M+ X Q* ^ E is 
¥* -predictable if and only if there exists a process 4> : M"*" x Ll ^ E defined on the 
canonical space D and ¥-predictable such that 

Ytiuj*) = (j){t,X*{uj*)) = (j){t,[X\{u;*)), y{t,uj*) xQ*. (13) 

Proof. First step. Let : R+ x D —be a F-predictable process on the canonical 
space. Notice that {t,ui*) i— >■ t and {t,UJ*) i— >■ [X*]t(a;*) are both F*-predictable. It 
follows that {t,oj*) I—>• Yt{ijj*) := (p{t, [X*]t(u;*)) is also F*-predictable. 

Second step. We now prove the converse relation. Suppose that T is a F*- 
predictable process of the type Kj(w*) = where 0 < < t 2 and 

A G Jy* . There exists C in X such that A = ([X*]t^)“^(C'), so that (I13p holds true 
with 

^(s,w) :=I(q,t2]xC7('S, [wjtj. 

In view of Proposition 13.41 it is clear that the above cj) is F-predictable. The same 
arguments show that (I13p holds also true if Y is of the form Ys{co*) = I{o}xA with 
A G Xq. Notice that the predictable ir-field on M'*' x Ll* w.r.t. F* is generated by 
the collection of all sets of the form {0} x A with A G Xq and of the form {ti,t 2 ] x A 
with 0 < ti < t 2 and A G T)* (see, e.g., [H Thm.IV.64]). It then remains to use the 
monotone class theorem to conclude. □ 


4 A first proof of Theorem 12.2 


In this section we develop the arguments sketched by Fleming and Souganidis [9] in 
the proof of their Lemma 1.11. 

Let T be a F'^-stopping time taking values in [t, +oo). We have 


- ^tas + 


rT\/0 


b{s, i's)ds + 


rT\/0 


a 


{s,X^’^’'^,Us)dBs, V0>O, P-a.s. 


Now, it follows from Lemma 13.II that there is a F-stopping time rf such that m 
holds true. Let (Pw)werJ ^ family of r.c.p. of P given Xrj. In view of Lemma [3.21 
for P-a.a. w G D, 




$ (Xb’^-^) Xf 1 (w) = E' 


(Xb^’^)] 


(14) 
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and 


(15) 


Pw(T = r(w), = [^‘’"’1r(w), 1 / = = 1. 

In view of Lemma [4.II below we have, for P-a.a. w G fl, 


-,e] 




a 


r(w),0] 


,Us)dBs, V6'>r(w), P^ - a.s., 


where the l.h.s. (resp. r.h.s.) term denotes the stochastic integral constructed under 
P (resp. Pw). It follows from (fTSl) that, for P-a.a. w G 11, 


^9 = Ka9 ( w ) + 


/*t(w)V0 
' r(w) 


6(s,X'’^’^i/jW’'")ds 


^t(w)V 0 

+ / V0>O, Pw-a.s. 

J r(w) 


Notice that, by Lemma 1^31 is -adapted, for P-a.a. w G 11. Hence, 

Xbx.i' jg solution of SDE Q with initial condition (t(w), [X*’^’‘^]t-(w)) and 
control in (H,F'^'",Pw) for P-a.a. w G H. As the SDE (H]) satisfies uniqueness 

in law, the law of under P^ coincides with the law of 

under P. We then conclude the proof by using (|14p . □ 

Lemma 4.1. Let H be a -predictable proeess such that 

r9 


{Hsfds < + 00 , V0 > 0, P — a.s. 


/o 


Then, using the same notation as above for the stochastic integrals, we have: For 
P-a.a. w G D, HgdBg is -measurable and 

fV /-Pw 

/ HgdBs = / HgdBs, V0>t(w), P^ — a.s. (16) 

■/[r,0] </[r(w),0] 

Proof. In this proof we implicitly use Lemma 13.31 to consider F^-predictable pro¬ 
cesses, such as the stochastic integrals defined under P, as F®”™-predictable processes 
for P-a.a. w G D. In particular, the l.h.s. of (|16p is -measurable for P-a.a. 
w G D. 

A standard localizing procedure allows us to assume that 




■ ^+oo 

/ {Hsf 

Jo 


ds 


< + 00 . 


Now let be a sequence of simple processes such that 


lim E^ 
n—>-oo 


= 0 . 


/‘+00 , .2 

- Hs) ds 

We then write 

rP rPw rP rPw 

/ HsdBs - / HsdBs = / H^^^dBs - / 

+ J^iHs - Hi^^)dBs - - Hi^^)dBs 


( 17 ) 
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and notice that the first difference is null since the stochastic integral is defined 
pathwise when the integrand is a simple process. 

By taking conditional expectations in (1171) , we get that there exists a subsequence 
such that 

= 0 , 

for P-a.a. w G Id. Hence, by Doob’s inequality and Ito’s isometry, 


lim 

n^oo 


/*+oo . . < 


ds 


lim E’'"™ 

n—>00 


sup 

0>r(w) 





[t(w),0] 

To conclude, it thus suffices to prove that 



= 0 . 


lim E"^™ 

n—)-oo 



= 0. 


for P-a.a. w G H. We cannot use Doob’s inequality and Ito’s isometry without care 
because the stochastic integrals are built under P and the expectation is computed 
under Pw However we have 


lim E'^ 

n —>-00 


sup 

e>o 



= 0 . 


Thus we can proceed as above by taking conditional expectation and extracting a 
new subsequence. That ends the proof. □ 


Remark 4.2. To avoid the technicalities of this first proof of Theorem \2.2l a natural 
attempt consists in solving the equations m in the space H equipped with the non 
augmented filtration. This may be achieved by following Stroock and Varadhan’s 
approach m to stochastic integration. This way leads to right-continuous and only 
P-a.s continuous stochastic integrals and solutions. As they are not valued in a 
Polish space, new delicate technical issues arise: for example, m and (USD need to 
be revisited. 


5 


A second proof of Theorem 




In this section we provide a second proof of Theorem 12.21 Compared to the above 
first proof, the technical details are lighter. However we emphasize that it uses that 
weak uniqueness for Brownian SDEs is equivalent to uniqueness of solutions to the 
corresponding martingale problems. Therefore it may be more difficult to extend 
this second methodology than the first one to stochastic control problems where the 
noise involves a Poisson random measure. 

Our second proof of Theorem 12.21 is based on the notion of controlled martingale 
problems on the enlarged canonical space H := Denote by F = {J-t)t>o the 
canonical filtration, and by [B,X) the canonical process. 

Fix (t, v) G M'*" X U. Let 6*’^ and be defined as in Section 12.11 For all 
functions y? in let the process {Mf ’ , 0 > t) be defined on the enlarged 

space D by 

Cl’’^ip{Cd)ds, 9 >t, (18) 


: = 


(p{uje) - 


I 


11 











where is the differential operator 

■■= b^’'"{s,u) ■ DifiYs) + ^ a^’''is,u) : D^(p{uJs)-, 

here, we set a^’‘^{s,Cd) := a})d'*’^(s,w)* and the operator is defined by 

p : q := Ti(pq*) for all p and q in S 2 d,d- It is clear that the process M is 
F-progressively measurable. 

Let (t, X, u) G M"*" xQxU. Denote by P the probability measure on D induced 
by (B, under the Wiener measure P. For all 99 in the process M 

is a martingale under P and 

^x(s)^ 0 < s < t) = 1 . 

Let r be a F'^-stopping time taking values in [t, + 00 ) and let p be the F-stopping 
time defined in Lemma 13.II Set h(w) := z^(w) and r/(w) := r/(w) for all w = (w, w') G 
D. It is clear that 77 is a F-stopping time. Then there is a family of r.c.p. of P 
given Tfj denoted by and a P*’^’^-null set N C Q such that 

{v ='il(y^),Bs = w{s),Xs = w'{s),0 < s <p{w)) = 1 ( 19 ) 

for all w = (w, W) G D \ A^. In particular, one has 

= Vs>0) = 1 


for all w = (w, w') G Q\N. Moreover, Lemma 6.1.3 in |19| combined with a standard 
localization argument implies that for P ’^’^-a.a. w G D, for all (p G the 

process 

j-e 

ifYe) - / CYp{u;)ds, 9 > p{\n), 

J 7?(w) 

is a martingale under P,^ ’ ■ It follows by (|19l) that for P ’ ’ -a.a. w G D, for all 

(f G C'^(R^'^), is a martingale under 

As weak uniqueness is equivalent to uniqueness of solutions to martingale prob- 
lerr0, for P*’^’^-a.a. w G D, P^^’^ coincides with the probability measure on Q 
induced by (w(8),j(w) il, under the Wiener measure P. Therefore, for 

all bounded random variable Y in we have 


pP Y] = [_ ($(w')T(w)) P*’'‘’''(dw) 

Jn 


' [$(x)] y(w)) P^’''’''((iw) 


= J_ y(w)) p*’^’^((iw) 

= ^ (J ( 77 ( 0 ;), y(a;)) P(da;). 

Since r/ = r P — a.s., we have completed the proof. 


□ 


^ Here the SDE to consider is : for all 0 G M"*", 


Zg = w(?7(w) A 


rr}{w)v9 

o) + 

r}{w) 






pri{w)ye 

(s,Z)ds+ / "(s,y)dH,. 

J ri{w) 
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